Negation and Speculation Identification in Chinese Language
نویسندگان
چکیده
Identifying negative or speculative narrative fragments from fact is crucial for natural language processing (NLP) applications. Previous studies on negation and speculation identification in Chinese language suffers much from two problems: corpus scarcity and the bottleneck in fundamental Chinese information processing. To resolve these problems, this paper constructs a Chinese corpus which consists of three sub-corpora from different resources. In order to detect the negative and speculative cues, a sequence labeling model is proposed. Moreover, a bilingual cue expansion method is proposed to increase the coverage in cue detection. In addition, this paper presents a new syntactic structure-based framework to identify the linguistic scope of a cue, instead of the traditional chunking-based framework. Experimental results justify the usefulness of our Chinese corpus and the appropriateness of our syntactic structure-based framework which obtained significant improvement over the stateof-the-art on negation and speculation identification in Chinese language. *
منابع مشابه
Negation and Speculation Target Identification
Negation and speculation are common in natural language text. Many applications, such as biomedical text mining and clinical information extraction, seek to distinguish positive/factual objects from negative/speculative ones (i.e., to determine what is negated or speculated) in biomedical texts. This paper proposes a novel task, called negation and speculation target identification, to identify...
متن کاملLinguistic scope-based and biological event-based speculation and negation annotations in the Genia Event and BioScope corpora
Background: The treatment of negation and hedging in natural language processing has received much interest recently, especially in the biomedical domain. However, open access corpora annotated for negation and/or speculation are hardly available for training and testing applications, and even if they are, they sometimes follow different design principles. In this paper, the annotation principl...
متن کاملA Unified Framework for Scope Learning via Simplified Shallow Semantic Parsing
s 94.99 94.35 94.67 Papers 90.48 87.47 88.95 Negation cue recognition Clinical 86.81 88.54 87.67 Abstracts 83.74 93.14 88.19 Papers 73.02 82.31 77.39 Speculation cue recognition Clinical 33.33 91.77 48.90s 83.74 93.14 88.19 Papers 73.02 82.31 77.39 Speculation cue recognition Clinical 33.33 91.77 48.90 Table 9: Performance of automatic cue recognition with automatic parse trees on the three sub...
متن کاملLinguistic scope-based and biological event-based speculation and negation annotations in the BioScope and Genia Event corpora
BACKGROUND The treatment of negation and hedging in natural language processing has received much interest recently, especially in the biomedical domain. However, open access corpora annotated for negation and/or speculation are hardly available for training and testing applications, and even if they are, they sometimes follow different design principles. In this paper, the annotation principle...
متن کاملDetecting Negated and Uncertain Information in Biomedical and Review Texts
The thesis proposed here intends to assist Natural Language Processing tasks through the negation and speculation detection. We are focusing on the biomedical and review domain in which it has been proven that the treatment of these language forms helps to improve the performance of the main task. In the biomedical domain, the existence of a corpus annotated for negation, speculation and their ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015